Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 2344823 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 4 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 661.9 MiB |
| Average record size in memory | 296.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 26 |
| Dataset has 4 (< 0.1%) duplicate rows | Duplicates |
IN_TREINEIRO is highly overall correlated with TP_FAIXA_ETARIA and 1 other fields | High correlation |
Q001 is highly overall correlated with Q002 and 1 other fields | High correlation |
Q002 is highly overall correlated with Q001 and 1 other fields | High correlation |
Q003 is highly overall correlated with Q001 | High correlation |
Q004 is highly overall correlated with Q002 | High correlation |
Q006 is highly overall correlated with Q018 | High correlation |
Q018 is highly overall correlated with Q006 | High correlation |
TP_ANO_CONCLUIU is highly overall correlated with TP_FAIXA_ETARIA | High correlation |
TP_ESCOLA is highly overall correlated with TP_ST_CONCLUSAO | High correlation |
TP_FAIXA_ETARIA is highly overall correlated with IN_TREINEIRO and 1 other fields | High correlation |
TP_ST_CONCLUSAO is highly overall correlated with IN_TREINEIRO and 1 other fields | High correlation |
TP_ESTADO_CIVIL is highly imbalanced (78.1%) | Imbalance |
TP_NACIONALIDADE is highly imbalanced (92.3%) | Imbalance |
Q007 is highly imbalanced (70.4%) | Imbalance |
Q011 is highly imbalanced (58.6%) | Imbalance |
Q012 is highly imbalanced (80.3%) | Imbalance |
Q014 is highly imbalanced (56.2%) | Imbalance |
Q015 is highly imbalanced (73.9%) | Imbalance |
Q016 is highly imbalanced (54.3%) | Imbalance |
Q017 is highly imbalanced (89.6%) | Imbalance |
Q025 is highly imbalanced (59.6%) | Imbalance |
TP_COR_RACA has 40871 (1.7%) zeros | Zeros |
TP_ANO_CONCLUIU has 1460782 (62.3%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-14 22:27:07.459058 |
|---|---|
| Analysis finished | 2024-04-14 22:30:03.059784 |
| Duration | 2 minutes and 55.6 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
TP_FAIXA_ETARIA
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.2380606 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 12 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.3427183 |
|---|---|
| Coefficient of variation (CV) | 0.78873774 |
| Kurtosis | 2.2625056 |
| Mean | 4.2380606 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6837301 |
| Sum | 9937502 |
| Variance | 11.173766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 593355 | |
| 2 | 576153 | |
| 4 | 269293 | |
| 1 | 247749 | |
| 5 | 151508 | 6.5% |
| 6 | 95792 | 4.1% |
| 11 | 88159 | 3.8% |
| 7 | 67515 | 2.9% |
| 8 | 49832 | 2.1% |
| 12 | 47083 | 2.0% |
| Other values (10) | 158384 | 6.8% |
| Value | Count | Frequency (%) |
| 1 | 247749 | |
| 2 | 576153 | |
| 3 | 593355 | |
| 4 | 269293 | |
| 5 | 151508 | 6.5% |
| 6 | 95792 | 4.1% |
| 7 | 67515 | 2.9% |
| 8 | 49832 | 2.1% |
| 9 | 36867 | 1.6% |
| 10 | 30176 | 1.3% |
| Value | Count | Frequency (%) |
| 20 | 315 | < 0.1% |
| 19 | 852 | < 0.1% |
| 18 | 2090 | 0.1% |
| 17 | 5149 | 0.2% |
| 16 | 9309 | 0.4% |
| 15 | 15339 | 0.7% |
| 14 | 23948 | 1.0% |
| 13 | 34339 | 1.5% |
| 12 | 47083 | |
| 11 | 88159 |
TP_SEXO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1436668 | |
| 1 | 908155 |
TP_ESTADO_CIVIL
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | 78251 |
| 0 | 73063 |
| 3 | 26871 |
| 4 | 1823 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2164815 | |
| 2 | 78251 | 3.3% |
| 0 | 73063 | 3.1% |
| 3 | 26871 | 1.1% |
| 4 | 1823 | 0.1% |
TP_COR_RACA
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.991332 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 40871 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.0184765 |
|---|---|
| Coefficient of variation (CV) | 0.51145492 |
| Kurtosis | -1.3097986 |
| Mean | 1.991332 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1328009 |
| Sum | 4669321 |
| Variance | 1.0372944 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1026418 | |
| 3 | 966698 | |
| 2 | 255863 | 10.9% |
| 4 | 43782 | 1.9% |
| 0 | 40871 | 1.7% |
| 5 | 11191 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 40871 | 1.7% |
| 1 | 1026418 | |
| 2 | 255863 | 10.9% |
| 3 | 966698 | |
| 4 | 43782 | 1.9% |
| 5 | 11191 | 0.5% |
| Value | Count | Frequency (%) |
| 5 | 11191 | 0.5% |
| 4 | 43782 | 1.9% |
| 3 | 966698 | |
| 2 | 255863 | 10.9% |
| 1 | 1026418 | |
| 0 | 40871 | 1.7% |
TP_NACIONALIDADE
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | 44504 |
| 4 | 5151 |
| 3 | 3660 |
| 0 | 885 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2290623 | |
| 2 | 44504 | 1.9% |
| 4 | 5151 | 0.2% |
| 3 | 3660 | 0.2% |
| 0 | 885 | < 0.1% |
TP_ST_CONCLUSAO
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 6903 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 963119 | |
| 2 | 957731 | |
| 3 | 417070 | |
| 4 | 6903 | 0.3% |
TP_ANO_CONCLUIU
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7279923 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 1460782 |
| Zeros (%) | 62.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 11 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.6107076 |
|---|---|
| Coefficient of variation (CV) | 2.0895392 |
| Kurtosis | 6.9738887 |
| Mean | 1.7279923 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.7217356 |
| Sum | 4051836 |
| Variance | 13.037209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1460782 | |
| 1 | 297575 | 12.7% |
| 2 | 131246 | 5.6% |
| 3 | 99233 | 4.2% |
| 16 | 70485 | 3.0% |
| 4 | 66101 | 2.8% |
| 5 | 49544 | 2.1% |
| 6 | 35287 | 1.5% |
| 7 | 27668 | 1.2% |
| 8 | 22275 | 0.9% |
| Other values (7) | 84627 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 1460782 | |
| 1 | 297575 | 12.7% |
| 2 | 131246 | 5.6% |
| 3 | 99233 | 4.2% |
| 4 | 66101 | 2.8% |
| 5 | 49544 | 2.1% |
| 6 | 35287 | 1.5% |
| 7 | 27668 | 1.2% |
| 8 | 22275 | 0.9% |
| 9 | 17891 | 0.8% |
| Value | Count | Frequency (%) |
| 16 | 70485 | |
| 15 | 8330 | 0.4% |
| 14 | 8669 | 0.4% |
| 13 | 10182 | 0.4% |
| 12 | 11450 | 0.5% |
| 11 | 12437 | 0.5% |
| 10 | 15668 | 0.7% |
| 9 | 17891 | 0.8% |
| 8 | 22275 | 0.9% |
| 7 | 27668 | 1.2% |
TP_ESCOLA
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1387092 | |
| 2 | 760853 | |
| 3 | 196878 | 8.4% |
IN_TREINEIRO
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1927753 | |
| 1 | 417070 | 17.8% |
TP_LINGUA
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1357622 | |
| 1 | 987201 |
Q001
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5935045 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.8760278 |
|---|---|
| Coefficient of variation (CV) | 0.40840883 |
| Kurtosis | -0.75607041 |
| Mean | 4.5935045 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.03745095 |
| Sum | 10770955 |
| Variance | 3.5194803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 720913 | |
| 2 | 350284 | |
| 3 | 291859 | |
| 4 | 259673 | 11.1% |
| 6 | 250540 | 10.7% |
| 8 | 202637 | 8.6% |
| 7 | 193050 | 8.2% |
| 1 | 75867 | 3.2% |
| Value | Count | Frequency (%) |
| 1 | 75867 | 3.2% |
| 2 | 350284 | |
| 3 | 291859 | |
| 4 | 259673 | 11.1% |
| 5 | 720913 | |
| 6 | 250540 | 10.7% |
| 7 | 193050 | 8.2% |
| 8 | 202637 | 8.6% |
| Value | Count | Frequency (%) |
| 8 | 202637 | 8.6% |
| 7 | 193050 | 8.2% |
| 6 | 250540 | 10.7% |
| 5 | 720913 | |
| 4 | 259673 | 11.1% |
| 3 | 291859 | |
| 2 | 350284 | |
| 1 | 75867 | 3.2% |
Q002
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8018234 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6215216 |
|---|---|
| Coefficient of variation (CV) | 0.33768873 |
| Kurtosis | -0.43221959 |
| Mean | 4.8018234 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.32283434 |
| Sum | 11259426 |
| Variance | 2.6293324 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 851162 | |
| 6 | 329384 | 14.0% |
| 7 | 322603 | 13.8% |
| 4 | 262334 | 11.2% |
| 2 | 238683 | 10.2% |
| 3 | 232165 | 9.9% |
| 8 | 62486 | 2.7% |
| 1 | 46006 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 46006 | 2.0% |
| 2 | 238683 | 10.2% |
| 3 | 232165 | 9.9% |
| 4 | 262334 | 11.2% |
| 5 | 851162 | |
| 6 | 329384 | 14.0% |
| 7 | 322603 | 13.8% |
| 8 | 62486 | 2.7% |
| Value | Count | Frequency (%) |
| 8 | 62486 | 2.7% |
| 7 | 322603 | 13.8% |
| 6 | 329384 | 14.0% |
| 5 | 851162 | |
| 4 | 262334 | 11.2% |
| 3 | 232165 | 9.9% |
| 2 | 238683 | 10.2% |
| 1 | 46006 | 2.0% |
Q003
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2131082 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5404451 |
|---|---|
| Coefficient of variation (CV) | 0.47942521 |
| Kurtosis | -0.86074084 |
| Mean | 3.2131082 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.24137501 |
| Sum | 7534170 |
| Variance | 2.372971 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 531108 | |
| 4 | 525019 | |
| 2 | 437711 | |
| 1 | 387595 | |
| 6 | 260803 | |
| 5 | 202587 | 8.6% |
| Value | Count | Frequency (%) |
| 1 | 387595 | |
| 2 | 437711 | |
| 3 | 531108 | |
| 4 | 525019 | |
| 5 | 202587 | 8.6% |
| 6 | 260803 |
| Value | Count | Frequency (%) |
| 6 | 260803 | |
| 5 | 202587 | 8.6% |
| 4 | 525019 | |
| 3 | 531108 | |
| 2 | 437711 | |
| 1 | 387595 |
Q004
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0045539 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4813366 |
|---|---|
| Coefficient of variation (CV) | 0.49303048 |
| Kurtosis | -0.81308274 |
| Mean | 3.0045539 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.48345606 |
| Sum | 7045147 |
| Variance | 2.1943582 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 902602 | |
| 4 | 645302 | |
| 1 | 309181 | 13.2% |
| 6 | 196040 | 8.4% |
| 5 | 149110 | 6.4% |
| 3 | 142588 | 6.1% |
| Value | Count | Frequency (%) |
| 1 | 309181 | 13.2% |
| 2 | 902602 | |
| 3 | 142588 | 6.1% |
| 4 | 645302 | |
| 5 | 149110 | 6.4% |
| 6 | 196040 | 8.4% |
| Value | Count | Frequency (%) |
| 6 | 196040 | 8.4% |
| 5 | 149110 | 6.4% |
| 4 | 645302 | |
| 3 | 142588 | 6.1% |
| 2 | 902602 | |
| 1 | 309181 | 13.2% |
Q005
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7702313 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.3073389 |
|---|---|
| Coefficient of variation (CV) | 0.34675297 |
| Kurtosis | 5.5508113 |
| Mean | 3.7702313 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1113839 |
| Sum | 8840525 |
| Variance | 1.7091349 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 822269 | |
| 3 | 660095 | |
| 5 | 352232 | |
| 2 | 284229 | 12.1% |
| 6 | 111907 | 4.8% |
| 1 | 47123 | 2.0% |
| 7 | 39124 | 1.7% |
| 8 | 15733 | 0.7% |
| 9 | 5868 | 0.3% |
| 10 | 3316 | 0.1% |
| Other values (10) | 2927 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 47123 | 2.0% |
| 2 | 284229 | 12.1% |
| 3 | 660095 | |
| 4 | 822269 | |
| 5 | 352232 | |
| 6 | 111907 | 4.8% |
| 7 | 39124 | 1.7% |
| 8 | 15733 | 0.7% |
| 9 | 5868 | 0.3% |
| 10 | 3316 | 0.1% |
| Value | Count | Frequency (%) |
| 20 | 172 | < 0.1% |
| 19 | 27 | < 0.1% |
| 18 | 24 | < 0.1% |
| 17 | 43 | < 0.1% |
| 16 | 50 | < 0.1% |
| 15 | 159 | < 0.1% |
| 14 | 190 | < 0.1% |
| 13 | 323 | < 0.1% |
| 12 | 788 | |
| 11 | 1151 |
Q006
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0370689 |
| Minimum | 1 |
|---|---|
| Maximum | 17 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 14 |
| Maximum | 17 |
| Range | 16 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.7950422 |
|---|---|
| Coefficient of variation (CV) | 0.75342273 |
| Kurtosis | 1.4457602 |
| Mean | 5.0370689 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.4291988 |
| Sum | 11811035 |
| Variance | 14.402345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 630492 | |
| 3 | 369704 | |
| 4 | 276804 | |
| 5 | 194527 | 8.3% |
| 8 | 145866 | 6.2% |
| 7 | 145812 | 6.2% |
| 1 | 119268 | 5.1% |
| 6 | 115203 | 4.9% |
| 9 | 62344 | 2.7% |
| 10 | 44115 | 1.9% |
| Other values (7) | 240688 | 10.3% |
| Value | Count | Frequency (%) |
| 1 | 119268 | 5.1% |
| 2 | 630492 | |
| 3 | 369704 | |
| 4 | 276804 | |
| 5 | 194527 | 8.3% |
| 6 | 115203 | 4.9% |
| 7 | 145812 | 6.2% |
| 8 | 145866 | 6.2% |
| 9 | 62344 | 2.7% |
| 10 | 44115 | 1.9% |
| Value | Count | Frequency (%) |
| 17 | 39060 | 1.7% |
| 16 | 29282 | 1.2% |
| 15 | 31826 | 1.4% |
| 14 | 28365 | 1.2% |
| 13 | 39300 | 1.7% |
| 12 | 41407 | 1.8% |
| 11 | 31448 | 1.3% |
| 10 | 44115 | 1.9% |
| 9 | 62344 | |
| 8 | 145866 |
Q007
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | 122670 |
| 4 | 76675 |
| 3 | 27572 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2117906 | |
| 2 | 122670 | 5.2% |
| 4 | 76675 | 3.3% |
| 3 | 27572 | 1.2% |
Q008
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 5 | 109713 |
| 1 | 13890 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1417203 | |
| 3 | 606569 | |
| 4 | 197448 | 8.4% |
| 5 | 109713 | 4.7% |
| 1 | 13890 | 0.6% |
Q009
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 3 | |
|---|---|
| 4 | |
| 2 | |
| 5 | |
| 1 | 13277 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 4 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1130738 | |
| 4 | 823909 | |
| 2 | 227697 | 9.7% |
| 5 | 149202 | 6.4% |
| 1 | 13277 | 0.6% |
Q010
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 29488 |
| 5 | 6280 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1100765 | |
| 2 | 959173 | |
| 3 | 249117 | 10.6% |
| 4 | 29488 | 1.3% |
| 5 | 6280 | 0.3% |
Q011
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 61037 |
| 4 | 5759 |
| 5 | 1090 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1752408 | |
| 2 | 524529 | 22.4% |
| 3 | 61037 | 2.6% |
| 4 | 5759 | 0.2% |
| 5 | 1090 | < 0.1% |
Q012
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 3 | 134253 |
| 1 | 27741 |
| 4 | 10098 |
| 5 | 1941 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2170790 | |
| 3 | 134253 | 5.7% |
| 1 | 27741 | 1.2% |
| 4 | 10098 | 0.4% |
| 5 | 1941 | 0.1% |
Q013
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 88886 |
| 4 | 10781 |
| 5 | 2381 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1172567 | |
| 2 | 1070208 | |
| 3 | 88886 | 3.8% |
| 4 | 10781 | 0.5% |
| 5 | 2381 | 0.1% |
Q014
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 29821 |
| 4 | 1032 |
| 5 | 203 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1530846 | |
| 1 | 782921 | |
| 3 | 29821 | 1.3% |
| 4 | 1032 | < 0.1% |
| 5 | 203 | < 0.1% |
Q015
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 4012 |
| 4 | 236 |
| 5 | 109 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2010385 | |
| 2 | 330081 | 14.1% |
| 3 | 4012 | 0.2% |
| 4 | 236 | < 0.1% |
| 5 | 109 | < 0.1% |
Q016
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 18910 |
| 4 | 821 |
| 5 | 239 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1239159 | |
| 1 | 1085694 | |
| 3 | 18910 | 0.8% |
| 4 | 821 | < 0.1% |
| 5 | 239 | < 0.1% |
Q017
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | 88434 |
| 3 | 1382 |
| 4 | 137 |
| 5 | 108 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2254762 | |
| 2 | 88434 | 3.8% |
| 3 | 1382 | 0.1% |
| 4 | 137 | < 0.1% |
| 5 | 108 | < 0.1% |
Q018
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1695096 | |
| 2 | 649727 | 27.7% |
Q019
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 1 | 123290 |
| 5 | 94070 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1458493 | |
| 3 | 480748 | 20.5% |
| 4 | 188222 | 8.0% |
| 1 | 123290 | 5.3% |
| 5 | 94070 | 4.0% |
Q020
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1912069 | |
| 2 | 432754 | 18.5% |
Q021
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1758480 | |
| 2 | 586343 | 25.0% |
Q022
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 4 | |
|---|---|
| 5 | |
| 3 | |
| 2 | |
| 1 | 47894 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 5 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 757635 | |
| 5 | 601948 | |
| 3 | 599482 | |
| 2 | 337864 | |
| 1 | 47894 | 2.0% |
Q023
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2023620 | |
| 2 | 321203 | 13.7% |
Q024
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | |
| 5 | 52244 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 988546 | |
| 2 | 932055 | |
| 3 | 267466 | 11.4% |
| 4 | 104512 | 4.5% |
| 5 | 52244 | 2.2% |
Q025
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.8 MiB |
| 2 | |
|---|---|
| 1 | 188553 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2344823 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2344823 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2344823 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2344823 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2156270 | |
| 1 | 188553 | 8.0% |
MEDIAS
Real number (ℝ)
| Distinct | 50309 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 543.48471 |
| Minimum | 0 |
|---|---|
| Maximum | 855.98 |
| Zeros | 22 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 35.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 402.08 |
| Q1 | 484.52 |
| median | 540.54 |
| Q3 | 602.06 |
| 95-th percentile | 693.14 |
| Maximum | 855.98 |
| Range | 855.98 |
| Interquartile range (IQR) | 117.54 |
Descriptive statistics
| Standard deviation | 88.04023 |
|---|---|
| Coefficient of variation (CV) | 0.1619921 |
| Kurtosis | -0.016444533 |
| Mean | 543.48471 |
| Median Absolute Deviation (MAD) | 58.58 |
| Skewness | 0.034281316 |
| Sum | 1.2743754 × 109 |
| Variance | 7751.0822 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 526.72 | 261 | < 0.1% |
| 557.2 | 259 | < 0.1% |
| 534.74 | 255 | < 0.1% |
| 531.5 | 254 | < 0.1% |
| 530.78 | 253 | < 0.1% |
| 513.4 | 251 | < 0.1% |
| 514.72 | 249 | < 0.1% |
| 529.5 | 249 | < 0.1% |
| 512.76 | 247 | < 0.1% |
| 530.6 | 246 | < 0.1% |
| Other values (50299) | 2342299 |
| Value | Count | Frequency (%) |
| 0 | 22 | |
| 56.14 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 66.1 | 1 | < 0.1% |
| 69.8 | 1 | < 0.1% |
| 72.12 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 82.28 | 1 | < 0.1% |
| 89.12 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 855.98 | 1 | |
| 855.82 | 1 | |
| 851.84 | 1 | |
| 849.86 | 1 | |
| 848.32 | 1 | |
| 843.5 | 1 | |
| 842.02 | 1 | |
| 841.98 | 1 | |
| 841.76 | 1 | |
| 841.1 | 1 |
| IN_TREINEIRO | MEDIAS | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | TP_ANO_CONCLUIU | TP_COR_RACA | TP_ESCOLA | TP_ESTADO_CIVIL | TP_FAIXA_ETARIA | TP_LINGUA | TP_NACIONALIDADE | TP_SEXO | TP_ST_CONCLUSAO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IN_TREINEIRO | 1.000 | 0.012 | 0.116 | 0.160 | 0.087 | 0.113 | 0.030 | 0.159 | 0.117 | 0.163 | 0.125 | 0.155 | 0.028 | 0.068 | 0.085 | 0.081 | 0.044 | 0.093 | 0.062 | 0.107 | 0.131 | 0.069 | 0.113 | 0.102 | 0.044 | 0.130 | 0.052 | -0.349 | -0.068 | 0.386 | 0.084 | -0.582 | 0.109 | 0.014 | 0.037 | 1.000 |
| MEDIAS | 0.012 | 1.000 | 0.256 | 0.315 | 0.248 | 0.278 | -0.068 | 0.463 | 0.125 | 0.179 | 0.110 | 0.162 | 0.034 | 0.065 | 0.130 | 0.122 | 0.059 | 0.129 | 0.077 | 0.289 | 0.148 | 0.129 | 0.197 | 0.125 | 0.164 | 0.221 | 0.188 | 0.076 | -0.228 | 0.192 | 0.038 | -0.073 | 0.277 | 0.033 | 0.060 | 0.064 |
| Q001 | 0.116 | 0.256 | 1.000 | 0.501 | 0.502 | 0.357 | -0.022 | 0.375 | 0.183 | 0.240 | 0.154 | 0.214 | 0.046 | 0.096 | 0.140 | 0.170 | 0.090 | 0.161 | 0.120 | 0.366 | 0.197 | 0.174 | 0.312 | 0.166 | 0.202 | 0.262 | 0.211 | -0.142 | -0.160 | 0.204 | 0.077 | -0.218 | 0.260 | 0.020 | 0.066 | 0.124 |
| Q002 | 0.160 | 0.315 | 0.501 | 1.000 | 0.349 | 0.519 | -0.017 | 0.466 | 0.166 | 0.216 | 0.149 | 0.203 | 0.022 | 0.086 | 0.125 | 0.156 | 0.077 | 0.147 | 0.094 | 0.317 | 0.173 | 0.135 | 0.278 | 0.154 | 0.166 | 0.236 | 0.212 | -0.165 | -0.177 | 0.191 | 0.098 | -0.283 | 0.236 | 0.019 | 0.066 | 0.134 |
| Q003 | 0.087 | 0.248 | 0.502 | 0.349 | 1.000 | 0.497 | -0.045 | 0.387 | 0.229 | 0.276 | 0.170 | 0.240 | 0.075 | 0.115 | 0.157 | 0.191 | 0.102 | 0.182 | 0.137 | 0.385 | 0.227 | 0.187 | 0.343 | 0.179 | 0.216 | 0.272 | 0.237 | -0.096 | -0.165 | 0.219 | 0.043 | -0.152 | 0.267 | 0.021 | 0.063 | 0.103 |
| Q004 | 0.113 | 0.278 | 0.357 | 0.519 | 0.497 | 1.000 | -0.022 | 0.464 | 0.227 | 0.256 | 0.159 | 0.224 | 0.062 | 0.106 | 0.147 | 0.191 | 0.094 | 0.176 | 0.124 | 0.351 | 0.207 | 0.163 | 0.319 | 0.168 | 0.196 | 0.259 | 0.244 | -0.116 | -0.183 | 0.201 | 0.048 | -0.194 | 0.249 | 0.021 | 0.065 | 0.107 |
| Q005 | 0.030 | -0.068 | -0.022 | -0.017 | -0.045 | -0.022 | 1.000 | 0.049 | 0.039 | 0.073 | 0.159 | 0.107 | 0.053 | 0.053 | 0.052 | 0.059 | 0.043 | 0.067 | 0.040 | 0.119 | 0.082 | 0.067 | 0.086 | 0.233 | 0.066 | 0.078 | 0.088 | -0.112 | 0.065 | 0.081 | 0.051 | -0.111 | 0.073 | 0.010 | 0.018 | 0.090 |
| Q006 | 0.159 | 0.463 | 0.375 | 0.466 | 0.387 | 0.464 | 0.049 | 1.000 | 0.307 | 0.366 | 0.249 | 0.349 | 0.051 | 0.161 | 0.219 | 0.257 | 0.141 | 0.242 | 0.180 | 0.528 | 0.310 | 0.253 | 0.450 | 0.260 | 0.279 | 0.375 | 0.308 | -0.090 | -0.290 | 0.252 | 0.021 | -0.217 | 0.299 | 0.027 | 0.098 | 0.111 |
| Q007 | 0.117 | 0.125 | 0.183 | 0.166 | 0.229 | 0.227 | 0.039 | 0.307 | 1.000 | 0.288 | 0.181 | 0.218 | 0.033 | 0.145 | 0.145 | 0.103 | 0.102 | 0.126 | 0.158 | 0.249 | 0.223 | 0.161 | 0.277 | 0.111 | 0.148 | 0.220 | 0.066 | -0.093 | -0.135 | 0.159 | 0.017 | -0.141 | 0.140 | 0.011 | 0.036 | 0.076 |
| Q008 | 0.163 | 0.179 | 0.240 | 0.216 | 0.276 | 0.256 | 0.073 | 0.366 | 0.288 | 1.000 | 0.343 | 0.306 | 0.042 | 0.226 | 0.212 | 0.205 | 0.121 | 0.208 | 0.168 | 0.444 | 0.318 | 0.237 | 0.382 | 0.224 | 0.245 | 0.301 | 0.247 | -0.120 | -0.222 | 0.213 | 0.027 | -0.217 | 0.236 | 0.020 | 0.062 | 0.107 |
| Q009 | 0.125 | 0.110 | 0.154 | 0.149 | 0.170 | 0.159 | 0.159 | 0.249 | 0.181 | 0.343 | 1.000 | 0.242 | 0.066 | 0.185 | 0.168 | 0.167 | 0.093 | 0.159 | 0.096 | 0.305 | 0.240 | 0.181 | 0.279 | 0.271 | 0.172 | 0.214 | 0.262 | -0.126 | -0.153 | 0.121 | 0.051 | -0.201 | 0.153 | 0.012 | 0.043 | 0.095 |
| Q010 | 0.155 | 0.162 | 0.214 | 0.203 | 0.240 | 0.224 | 0.107 | 0.349 | 0.218 | 0.306 | 0.242 | 1.000 | 0.055 | 0.177 | 0.211 | 0.240 | 0.128 | 0.231 | 0.131 | 0.479 | 0.267 | 0.211 | 0.359 | 0.240 | 0.239 | 0.281 | 0.260 | -0.149 | -0.262 | 0.175 | 0.024 | -0.247 | 0.233 | 0.018 | 0.060 | 0.113 |
| Q011 | 0.028 | 0.034 | 0.046 | 0.022 | 0.075 | 0.062 | 0.053 | 0.051 | 0.033 | 0.042 | 0.066 | 0.055 | 1.000 | 0.043 | 0.039 | 0.064 | 0.079 | 0.058 | 0.075 | 0.061 | 0.038 | 0.034 | 0.049 | 0.063 | 0.068 | 0.039 | 0.055 | -0.041 | 0.054 | 0.053 | 0.012 | -0.049 | 0.072 | 0.008 | 0.033 | 0.028 |
| Q012 | 0.068 | 0.065 | 0.096 | 0.086 | 0.115 | 0.106 | 0.053 | 0.161 | 0.145 | 0.226 | 0.185 | 0.177 | 0.043 | 1.000 | 0.310 | 0.155 | 0.111 | 0.187 | 0.116 | 0.232 | 0.207 | 0.159 | 0.207 | 0.134 | 0.140 | 0.145 | 0.184 | -0.073 | -0.112 | 0.086 | 0.015 | -0.108 | 0.104 | 0.008 | 0.039 | 0.052 |
| Q013 | 0.085 | 0.130 | 0.140 | 0.125 | 0.157 | 0.147 | 0.052 | 0.219 | 0.145 | 0.212 | 0.168 | 0.211 | 0.039 | 0.310 | 1.000 | 0.210 | 0.159 | 0.215 | 0.114 | 0.362 | 0.213 | 0.210 | 0.287 | 0.177 | 0.186 | 0.199 | 0.210 | -0.122 | -0.202 | 0.112 | 0.041 | -0.181 | 0.199 | 0.012 | 0.033 | 0.077 |
| Q014 | 0.081 | 0.122 | 0.170 | 0.156 | 0.191 | 0.191 | 0.059 | 0.257 | 0.103 | 0.205 | 0.167 | 0.240 | 0.064 | 0.155 | 0.210 | 1.000 | 0.320 | 0.292 | 0.207 | 0.391 | 0.204 | 0.163 | 0.295 | 0.196 | 0.207 | 0.225 | 0.288 | -0.107 | -0.235 | 0.118 | 0.012 | -0.159 | 0.217 | 0.016 | 0.064 | 0.073 |
| Q015 | 0.044 | 0.059 | 0.090 | 0.077 | 0.102 | 0.094 | 0.043 | 0.141 | 0.102 | 0.121 | 0.093 | 0.128 | 0.079 | 0.111 | 0.159 | 0.320 | 1.000 | 0.236 | 0.328 | 0.249 | 0.121 | 0.141 | 0.201 | 0.095 | 0.100 | 0.129 | 0.098 | -0.075 | -0.111 | 0.074 | 0.015 | -0.097 | 0.096 | 0.008 | 0.029 | 0.046 |
| Q016 | 0.093 | 0.129 | 0.161 | 0.147 | 0.182 | 0.176 | 0.067 | 0.242 | 0.126 | 0.208 | 0.159 | 0.231 | 0.058 | 0.187 | 0.215 | 0.292 | 0.236 | 1.000 | 0.252 | 0.410 | 0.225 | 0.189 | 0.300 | 0.173 | 0.212 | 0.221 | 0.261 | -0.101 | -0.242 | 0.130 | 0.016 | -0.164 | 0.220 | 0.016 | 0.059 | 0.075 |
| Q017 | 0.062 | 0.077 | 0.120 | 0.094 | 0.137 | 0.124 | 0.040 | 0.180 | 0.158 | 0.168 | 0.096 | 0.131 | 0.075 | 0.116 | 0.114 | 0.207 | 0.328 | 0.252 | 1.000 | 0.234 | 0.130 | 0.144 | 0.183 | 0.067 | 0.125 | 0.152 | 0.053 | -0.055 | -0.113 | 0.108 | 0.007 | -0.084 | 0.106 | 0.010 | 0.040 | 0.042 |
| Q018 | 0.107 | 0.289 | 0.366 | 0.317 | 0.385 | 0.351 | 0.119 | 0.528 | 0.249 | 0.444 | 0.305 | 0.479 | 0.061 | 0.232 | 0.362 | 0.391 | 0.249 | 0.410 | 0.234 | 1.000 | 0.460 | 0.250 | 0.337 | 0.325 | 0.240 | 0.490 | 0.171 | -0.106 | -0.259 | 0.221 | 0.029 | -0.172 | 0.236 | 0.036 | 0.054 | 0.139 |
| Q019 | 0.131 | 0.148 | 0.197 | 0.173 | 0.227 | 0.207 | 0.082 | 0.310 | 0.223 | 0.318 | 0.240 | 0.267 | 0.038 | 0.207 | 0.213 | 0.204 | 0.121 | 0.225 | 0.130 | 0.460 | 1.000 | 0.279 | 0.413 | 0.234 | 0.263 | 0.282 | 0.231 | -0.123 | -0.219 | 0.191 | 0.032 | -0.197 | 0.231 | 0.020 | 0.084 | 0.095 |
| Q020 | 0.069 | 0.129 | 0.174 | 0.135 | 0.187 | 0.163 | 0.067 | 0.253 | 0.161 | 0.237 | 0.181 | 0.211 | 0.034 | 0.159 | 0.210 | 0.163 | 0.141 | 0.189 | 0.144 | 0.250 | 0.279 | 1.000 | 0.211 | 0.192 | 0.173 | 0.267 | 0.079 | -0.078 | -0.100 | 0.109 | 0.033 | -0.107 | 0.109 | 0.014 | 0.020 | 0.093 |
| Q021 | 0.113 | 0.197 | 0.312 | 0.278 | 0.343 | 0.319 | 0.086 | 0.450 | 0.277 | 0.382 | 0.279 | 0.359 | 0.049 | 0.207 | 0.287 | 0.295 | 0.201 | 0.300 | 0.183 | 0.337 | 0.413 | 0.211 | 1.000 | 0.292 | 0.229 | 0.382 | 0.151 | -0.121 | -0.162 | 0.206 | 0.041 | -0.175 | 0.168 | 0.021 | 0.023 | 0.148 |
| Q022 | 0.102 | 0.125 | 0.166 | 0.154 | 0.179 | 0.168 | 0.233 | 0.260 | 0.111 | 0.224 | 0.271 | 0.240 | 0.063 | 0.134 | 0.177 | 0.196 | 0.095 | 0.173 | 0.067 | 0.325 | 0.234 | 0.192 | 0.292 | 1.000 | 0.164 | 0.230 | 0.348 | -0.135 | -0.151 | 0.107 | 0.070 | -0.212 | 0.187 | 0.013 | 0.045 | 0.092 |
| Q023 | 0.044 | 0.164 | 0.202 | 0.166 | 0.216 | 0.196 | 0.066 | 0.279 | 0.148 | 0.245 | 0.172 | 0.239 | 0.068 | 0.140 | 0.186 | 0.207 | 0.100 | 0.212 | 0.125 | 0.240 | 0.263 | 0.173 | 0.229 | 0.164 | 1.000 | 0.271 | 0.105 | -0.031 | -0.123 | 0.134 | 0.024 | -0.065 | 0.133 | 0.020 | 0.033 | 0.050 |
| Q024 | 0.130 | 0.221 | 0.262 | 0.236 | 0.272 | 0.259 | 0.078 | 0.375 | 0.220 | 0.301 | 0.214 | 0.281 | 0.039 | 0.145 | 0.199 | 0.225 | 0.129 | 0.221 | 0.152 | 0.490 | 0.282 | 0.267 | 0.382 | 0.230 | 0.271 | 1.000 | 0.300 | -0.032 | -0.260 | 0.207 | 0.017 | -0.136 | 0.280 | 0.024 | 0.099 | 0.081 |
| Q025 | 0.052 | 0.188 | 0.211 | 0.212 | 0.237 | 0.244 | 0.088 | 0.308 | 0.066 | 0.247 | 0.262 | 0.260 | 0.055 | 0.184 | 0.210 | 0.288 | 0.098 | 0.261 | 0.053 | 0.171 | 0.231 | 0.079 | 0.151 | 0.348 | 0.105 | 0.300 | 1.000 | -0.042 | -0.128 | 0.084 | 0.017 | -0.094 | 0.129 | 0.016 | 0.033 | 0.062 |
| TP_ANO_CONCLUIU | -0.349 | 0.076 | -0.142 | -0.165 | -0.096 | -0.116 | -0.112 | -0.090 | -0.093 | -0.120 | -0.126 | -0.149 | -0.041 | -0.073 | -0.122 | -0.107 | -0.075 | -0.101 | -0.055 | -0.106 | -0.123 | -0.078 | -0.121 | -0.135 | -0.031 | -0.032 | -0.042 | 1.000 | 0.062 | 0.339 | 0.228 | 0.752 | 0.117 | 0.011 | 0.017 | 0.399 |
| TP_COR_RACA | -0.068 | -0.228 | -0.160 | -0.177 | -0.165 | -0.183 | 0.065 | -0.290 | -0.135 | -0.222 | -0.153 | -0.262 | 0.054 | -0.112 | -0.202 | -0.235 | -0.111 | -0.242 | -0.113 | -0.259 | -0.219 | -0.100 | -0.162 | -0.151 | -0.123 | -0.260 | -0.128 | 0.062 | 1.000 | 0.111 | 0.045 | 0.110 | 0.180 | 0.038 | 0.019 | 0.066 |
| TP_ESCOLA | 0.386 | 0.192 | 0.204 | 0.191 | 0.219 | 0.201 | 0.081 | 0.252 | 0.159 | 0.213 | 0.121 | 0.175 | 0.053 | 0.086 | 0.112 | 0.118 | 0.074 | 0.130 | 0.108 | 0.221 | 0.191 | 0.109 | 0.206 | 0.107 | 0.134 | 0.207 | 0.084 | 0.339 | 0.111 | 1.000 | 0.096 | -0.321 | 0.134 | 0.019 | 0.050 | 0.707 |
| TP_ESTADO_CIVIL | 0.084 | 0.038 | 0.077 | 0.098 | 0.043 | 0.048 | 0.051 | 0.021 | 0.017 | 0.027 | 0.051 | 0.024 | 0.012 | 0.015 | 0.041 | 0.012 | 0.015 | 0.016 | 0.007 | 0.029 | 0.032 | 0.033 | 0.041 | 0.070 | 0.024 | 0.017 | 0.017 | 0.228 | 0.045 | 0.096 | 1.000 | 0.190 | 0.085 | 0.013 | 0.018 | 0.116 |
| TP_FAIXA_ETARIA | -0.582 | -0.073 | -0.218 | -0.283 | -0.152 | -0.194 | -0.111 | -0.217 | -0.141 | -0.217 | -0.201 | -0.247 | -0.049 | -0.108 | -0.181 | -0.159 | -0.097 | -0.164 | -0.084 | -0.172 | -0.197 | -0.107 | -0.175 | -0.212 | -0.065 | -0.136 | -0.094 | 0.752 | 0.110 | -0.321 | 0.190 | 1.000 | 0.177 | 0.011 | 0.032 | 0.498 |
| TP_LINGUA | 0.109 | 0.277 | 0.260 | 0.236 | 0.267 | 0.249 | 0.073 | 0.299 | 0.140 | 0.236 | 0.153 | 0.233 | 0.072 | 0.104 | 0.199 | 0.217 | 0.096 | 0.220 | 0.106 | 0.236 | 0.231 | 0.109 | 0.168 | 0.187 | 0.133 | 0.280 | 0.129 | 0.117 | 0.180 | 0.134 | 0.085 | 0.177 | 1.000 | 0.034 | 0.098 | 0.136 |
| TP_NACIONALIDADE | 0.014 | 0.033 | 0.020 | 0.019 | 0.021 | 0.021 | 0.010 | 0.027 | 0.011 | 0.020 | 0.012 | 0.018 | 0.008 | 0.008 | 0.012 | 0.016 | 0.008 | 0.016 | 0.010 | 0.036 | 0.020 | 0.014 | 0.021 | 0.013 | 0.020 | 0.024 | 0.016 | 0.011 | 0.038 | 0.019 | 0.013 | 0.011 | 0.034 | 1.000 | 0.028 | 0.011 |
| TP_SEXO | 0.037 | 0.060 | 0.066 | 0.066 | 0.063 | 0.065 | 0.018 | 0.098 | 0.036 | 0.062 | 0.043 | 0.060 | 0.033 | 0.039 | 0.033 | 0.064 | 0.029 | 0.059 | 0.040 | 0.054 | 0.084 | 0.020 | 0.023 | 0.045 | 0.033 | 0.099 | 0.033 | 0.017 | 0.019 | 0.050 | 0.018 | 0.032 | 0.098 | 0.028 | 1.000 | 0.042 |
| TP_ST_CONCLUSAO | 1.000 | 0.064 | 0.124 | 0.134 | 0.103 | 0.107 | 0.090 | 0.111 | 0.076 | 0.107 | 0.095 | 0.113 | 0.028 | 0.052 | 0.077 | 0.073 | 0.046 | 0.075 | 0.042 | 0.139 | 0.095 | 0.093 | 0.148 | 0.092 | 0.050 | 0.081 | 0.062 | 0.399 | 0.066 | 0.707 | 0.116 | 0.498 | 0.136 | 0.011 | 0.042 | 1.000 |
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5 | 0 | 1 | 2 | 1 | 1 | 2 | 1 | 0 | 1 | 5 | 6 | 1 | 4 | 2 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 2 | 558.24 |
| 1 | 6 | 1 | 1 | 3 | 1 | 1 | 2 | 1 | 0 | 1 | 3 | 1 | 1 | 2 | 3 | 1 | 1 | 3 | 4 | 1 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 3 | 2 | 2 | 2 | 394.62 |
| 2 | 6 | 0 | 1 | 2 | 1 | 1 | 0 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 5 | 2 | 1 | 2 | 3 | 2 | 1 | 3 | 2 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 5 | 1 | 1 | 2 | 414.10 |
| 3 | 4 | 0 | 1 | 3 | 1 | 1 | 1 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 2 | 2 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 438.10 |
| 4 | 2 | 0 | 1 | 1 | 1 | 2 | 0 | 3 | 0 | 1 | 5 | 5 | 2 | 1 | 4 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 576.70 |
| 5 | 2 | 0 | 1 | 3 | 1 | 3 | 0 | 1 | 1 | 0 | 7 | 6 | 6 | 6 | 2 | 2 | 1 | 2 | 3 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 3 | 2 | 2 | 2 | 530.58 |
| 6 | 8 | 0 | 1 | 2 | 1 | 1 | 5 | 1 | 0 | 1 | 2 | 6 | 1 | 4 | 6 | 2 | 1 | 2 | 5 | 2 | 2 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 4 | 1 | 1 | 2 | 645.80 |
| 7 | 1 | 0 | 1 | 3 | 1 | 3 | 0 | 1 | 1 | 0 | 8 | 5 | 3 | 2 | 6 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 4 | 1 | 2 | 2 | 378.74 |
| 8 | 4 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 2 | 4 | 4 | 2 | 2 | 2 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 2 | 1 | 1 | 2 | 2 | 1 | 3 | 1 | 2 | 2 | 500.40 |
| 9 | 4 | 1 | 1 | 3 | 1 | 1 | 1 | 1 | 0 | 1 | 5 | 5 | 2 | 2 | 3 | 5 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | 2 | 1 | 1 | 2 | 605.58 |
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2504004 | 12 | 1 | 1 | 3 | 1 | 1 | 16 | 1 | 0 | 0 | 3 | 4 | 6 | 2 | 3 | 7 | 1 | 2 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 5 | 2 | 2 | 4 | 2 | 3 | 2 | 599.36 |
| 2504005 | 2 | 1 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 0 | 4 | 5 | 3 | 3 | 3 | 6 | 1 | 3 | 4 | 1 | 2 | 2 | 3 | 2 | 1 | 2 | 1 | 2 | 3 | 2 | 1 | 4 | 1 | 2 | 2 | 526.38 |
| 2504006 | 6 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 2 | 5 | 1 | 1 | 5 | 4 | 1 | 2 | 5 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 533.66 |
| 2504007 | 3 | 1 | 0 | 2 | 1 | 1 | 1 | 1 | 0 | 0 | 5 | 5 | 6 | 2 | 5 | 2 | 1 | 3 | 3 | 1 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 2 | 2 | 1 | 4 | 1 | 2 | 2 | 467.20 |
| 2504008 | 3 | 0 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 0 | 5 | 6 | 3 | 4 | 4 | 6 | 1 | 2 | 3 | 1 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 5 | 1 | 2 | 2 | 515.02 |
| 2504009 | 12 | 1 | 2 | 1 | 1 | 1 | 7 | 1 | 0 | 1 | 5 | 5 | 3 | 3 | 4 | 3 | 1 | 2 | 2 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 488.40 |
| 2504010 | 11 | 0 | 1 | 2 | 1 | 1 | 11 | 1 | 0 | 1 | 8 | 3 | 6 | 3 | 1 | 4 | 1 | 2 | 2 | 1 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 2 | 2 | 617.92 |
| 2504011 | 2 | 1 | 0 | 3 | 1 | 2 | 0 | 2 | 0 | 0 | 8 | 5 | 3 | 2 | 2 | 2 | 1 | 2 | 2 | 1 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 541.22 |
| 2504012 | 11 | 0 | 1 | 1 | 1 | 1 | 11 | 1 | 0 | 0 | 3 | 2 | 2 | 2 | 3 | 2 | 1 | 2 | 3 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 2 | 2 | 2 | 1 | 3 | 1 | 1 | 1 | 507.22 |
| 2504013 | 2 | 1 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 0 | 5 | 3 | 3 | 2 | 4 | 7 | 1 | 3 | 4 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 3 | 2 | 1 | 5 | 1 | 2 | 2 | 607.06 |
Most frequently occurring
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 1 | 1 | 3 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 495.84 | 2 |
| 1 | 3 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 1 | 1 | 4 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 498.88 | 2 |
| 2 | 4 | 0 | 1 | 3 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 5 | 2 | 2 | 4 | 2 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 578.24 | 2 |
| 3 | 4 | 0 | 1 | 3 | 1 | 1 | 1 | 1 | 0 | 1 | 2 | 2 | 1 | 1 | 4 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 2 | 1 | 1 | 2 | 561.40 | 2 |